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Abstract 

We introduce the following elementary scheduling problem. We are given a collection of n 
jobs, where each job Ji has an integer length li as well as a set Ti of time intervals in which it 
can be feasibly scheduled. Given a parameter B, the processor can schedule up to B jobs at a 
timeslot t so long as it is "active" at t. The goal is to schedule all the jobs in the fewest number of 
active timeslots. The machine consumes a fixed amount of energy per active timeslot, regardless 
of the number of jobs scheduled in that slot (as long as the number of jobs is non-zero). In other 
words, subject to l{ units of each job i being scheduled in its feasible region and at each slot at 
most B jobs being scheduled, we are interested in minimizing the total time during which the 
machine is active. We present a linear time algorithm for the case where jobs are unit length 
and each Ti is a single interval. For general Ti, we show that the problem is ./VP-complete even 
for B = 3. However when B — 2, we show that it can be efficiently solved. In addition, we 
consider a version of the problem where jobs have arbitrary lengths and can be preempted at 
any point in time. For general B 7 the problem can be solved by linear programming. For B = 2, 
the problem amounts to finding a triangle-free 2-matching on a special graph. We extend the 
algorithm of Babcnko ct. al. [5] to handle our variant, and also to handle non-unit length jobs. 
This yields an 0(y/Lm) time algorithm to solve the preemptive scheduling problem for B = 2, 
where L = We also show that for B = 2 and unit length jobs, the optimal non-preemptive 

schedule has < 4/3 times the active time of the optimal preemptive schedule; this bound extends 
to several versions of the problem when jobs have arbitrary length. 

1 Introduction 

Power management strategies have been widely studied in the scheduling literature [TJ [21 [4"T| [59| 
|4"0| 17]. Many of the models are motivated by the energy consumption of the processor. Consider, 
alternatively, the energy consumed by the operation of large storage systems. Data is stored in 
memory which may be turned on and off [3], and each task or job needs to access a subset of data 
items to run. At each time step, the scheduler can work on a group of at most B jobs. The only 
requirement is that the memory banks containing the required data from these jobs be turned on. 
The problem studied in this paper is the special case where all the data is in one memory bank. 
For even special cases involving multiple memory banks, the problem becomes AP-complete: if 
each job needs access to multiple memory banks in order to be satisfied, via a reduction from 
the fc-densest subgraph problem, it is AP-complete to determine whether there exists a schedule 
satisfying C jobs and being active for at most A units of time. 
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We propose a simple model for measuring energy usage on a parallel machine. Rather than 
focusing on conventional metrics measuring the quality of the schedule, we focus on problems 
motivated by energy savings in "efficient" schedules. 

In many applications, a job has many intervals of availability because, e.g., it interfaces with 
an external event such as a satellite reading or a recurring broadcast. The real-time and period 
scheduling literatures address problems in this space. More broadly, tasks may be constrained by 
user availability, introducing irregularity in the feasible intervals. Our model is defined generally 
enough to capture jobs of this nature. 

More formally, we are given a collection of n jobs, each job Jj having an integer length and a 
set Ti of time intervals with integer boundaries in which it can be feasibly scheduled. In particular, 
= i^k = [ r fc'^fc]}fc^i * s a non-empty set of disjoint intervals. Note that if mj = 1, then we can 
think of job J{ as having a single release time and a single deadline. For ease of notation, we may 
sometimes refer to job Jj as job i. In addition, time is divided into unit length timeslots and for 
a given parallelism parameter B, the system (or machine) can schedule up to B jobs in a single 
timeslot. If the machine schedules any jobs at timeslot t, we say that it is "active at t". The goal is 
to schedule all jobs, i.e. schedule them within their feasible regions, while minimizing the number 
of slots during which the machine is active. The machine consumes a fixed amount of energy per 
active slot. In other words, subject to each job Jj being scheduled within its feasible region Ti, 
and subject to at most B jobs being scheduled at any time, we would like to minimize the total 
active time spent scheduling the jobs. Note that there may be instances when there is no feasible 
schedule for all the jobs. However, this case is easy to detect. Note that for a timeslot significantly 
large (e.g. on the order of an hour), any overhead cost for starting a memory bank is negligible 
compared to the energy spent being "on" for that unit of time. 

To illustrate this model in other domains, consider the following operational problem. Suppose 
that a ship can carry up to B cargo containers from one port to another. Jobs have delivery 
requirements leading to release times and deadlines. Finding an optimal schedule corresponds to 
the minimum number of times we need to send the ship to deliver all the packages on time. The 
motivating assumption is that it costs roughly the same to send the ship, regardless of load and 
that there is an upper bound on the load. 

We could also consider this as a basic form of "batch" processing similar to the work initiated 
by Ikura and Gimple [39]. Their algorithm is designed to minimize completion time for batch 
processing on a single machine for the special case of agreeabl^ release times and deadlines. Baptiste 
[6] extended the Ikura and Gimple results to general release times and deadlines and an efficient 
algorithm was recently given by Condotta et. al.|15j. All of these works focus merely on trying to 
find a feasible schedule (which then can be used as a subroutine to minimize maximum lateness). 
However in our problem, in addition we wish to minimize the number of batches. 

In the scheduling literature, often problems with unit processing times are trivial since they can 
be solved using matching techniques. However, in different models which allow for overlap in job 
satisfaction, e.g. broadcast scheduling [22], [I2j [TT] , the problems often turn out to be iVP-complete; 
in fact, several variants of broadcast scheduling have been shown to be A^P-complete [llj . The 
problem considered in this paper also contains an element of "overlap" since we can schedule up to 
B jobs in a slot at unit cost and wish to minimize the number of active slots. 

For the cases of unit length jobs and those in which jobs can be preempted at integral time 
points, our scheduling problem can be modeled as a bipartite matching problem in which each 
node on the left needs to be matched with a node on the right. Each node on the right has a 
capacity of B, and we are interested in minimizing the number of nodes on the right that have 

1 When the ordering of jobs by release times is the same as the ordering of jobs by deadlines. 
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non-zero degree. This problem can easily be shown to be ATP-hard. Hence it is slightly surprising 
that for unit length jobs with each Tj being a single interval, we can develop a fast algorithm to 
obtain an optimal solution to the scheduling problem defined above^l Our algorithm is an almost 
greedy scheme, which intuitively abides by a lazy activation principle: schedule jobs in batches of 
size up to B delaying the batch as long as possible. At each step, we select "filler" jobs (with later 
deadlines) to fill slots which otherwise would have at least one and less than B jobs, based on an 
Earliest Deadline First (EDF) strategy. The algorithm as described does not quite work, since we 
may schedule some jobs using the lazy activation principle and later discover that these jobs should 
have been scheduled earlier to make space for other jobs with later deadlines. One way to address 
this problem is to dynamically re-assign jobs to time slots. Our first attempt was based on this 
idea, but it resulted in a slower algorithm with a more complicated analysis. However, we are able 
to address this issue by pre-processing the jobs to create a new instance with "adjusted" deadlines, 
so that at most B jobs have the same deadline. Then, no re-assignment of jobs is required. As we 
will see, for infeasible instances, this algorithm has the additional property that it will schedule the 
maximum number of jobs. 

In particular, as our paper shows, even the problem where B = 2 has a lot of structure due 
to its connection with matchings in graphs. We anticipate that this structure will be useful in 
the design of improved approximation algorithms for B > 2 (an O(logn) approximation algorithm 
follows from the work of Wolsey [57]). 
Main Results: 

1. For the case where jobs have unit length and each Tj is a single interval, we first develop an 
algorithm whose running time is O(nlogn). We then show how to improve its running time 
to linear. Our algorithm takes n jobs as input with integral release times and deadlines and 
outputs a schedule with the smallest number of active slots. The algorithm has the additional 
property that for infeasible instances, it schedules the maximum number of jobs. We also 
note that the slotted aspect of the time model is but a technical convenience. It can be shown 
without loss of generality that time is slotted when job lengths, release times and deadlines 
are integral (Section [2]). 

2. When the release times and deadlines are not integral, non-preemptively scheduling unit 
length jobs to minimize the number of batches can be solved optimally in polynomial time 
via dynamic programming (Section [3]). This objective differs from active time: a batch must 
start all its jobs at the same time and the system may work on at most one batch at a time. 
Even so, scheduling unit length jobs with integral release times and deadlines to minimize 
active time is clearly a special case of this. We extend the result to the case when we have a 
budget on the number of active slots (Section HJ. 

3. In addition, we consider the generalization to arbitrary Tj. This problem is closely related 
to vertex cover with hard capacities, the fc-center problem and capacitated facility location, 
all classic covering problems. In particular, for the special case where every job is feasible 
in exactly two timeslots, there is a LP-rounding 2-approximation, which is implied from the 
vertex cover result in [29]. The complexity of the problem depends on the value of B, since for 
any fixed B > 3, the problem is A^P-hard. When B = 2 this problem can be solved optimally 
in 0{rriy/n) time where m is the total number of time slots which are feasible for some job 
(Section [5}. We show that this problem is essentially equivalent to the maximum matching 

2 The problem can be solved in 0(n 2 T 2 (n + T) time using Dynamic Programming as was shown by Even et. al. 
[21] . albeit the complexity of their solution is high. Their algorithm solves the problem of stabbing a collection of 
horizontal intervals with the smallest number of vertical stabbers, each stabber having a bounded capacity. 
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problem computationally. In addition, we show that this algorithm can be extended to the 
case of non-unit length jobs when a job can be scheduled in unit sized pieces (Section [6]). 

4. We also consider a version of the problem when jobs have arbitrary lengths and can be 
preempted at any point in time, i.e. not just at integer time points. For general B the 
problem can be solved by linear programming. For B = 2 the problem amounts to finding 
a maximum triangle-free 2-matching on a special graph. Babenko et. al. present an elegant 
algorithm showing that a maximum cardinality triangle-free 2-matching can be found in the 
same time as a maximum cardinality matching [5]. We extend it for our scheduling problem 
to show that when B = 2 and jobs have arbitrary integral length, an optimal preemptive 
schedule can be found in 0(\^Lm) time, for L the total length of all jobs. Any preemptions 
occur at integral or half- integral times. 

5. In Section 7, we also give a tight bound on the gain from arbitrary preemption: an optimal 
schedule allowing only preemption at integral times uses at most 4/3 the active time of the 
optimal preemptive schedule. We also note that this bound is best possible since there is 
a trivial example with three unit jobs where the optimal schedule which allows preemption 
only at integer points uses two slots, and if we allow arbitrary preemptions, these jobs can be 
scheduled in 1.5 slots giving the ratio of 4/3. 

1.1 Related Work 

A classical problem related to our work is the well known "Scheduling unit jobs on B processors with 
precedence constraints", in which n unit jobs are given with precedence constraints and the goal is 
to schedule the jobs on B processors to minimize the maximum completion time. Again this can be 
viewed as minimizing the active time. For arbitrary B the problem is A^P-complete |34j . For fixed 
B, the problem is known to be W[2] hard [9]. For the case where B = 2, this problem can be solved 
optimally in polynomial time |25|l28j. Garey and Johnson [351 [36] consider the problem of scheduling 
unit jobs with integer release times and deadlines with precedence constraints, providing polynomial 
time algorithms for B = 2. In addition, their algorithm finds a schedule with minimum lateness. 
This was extended to the case of real release times and deadlines by Wu and Jaffar [58] , who gave an 
0(n 4 ) algorithm. Their primary technique involves computing successor-tree-consistent deadlines, 
which effectively upper bounds the latest completion time for each job. Successor-tree-consistency 
allows the optimal schedule to be computed via a slight variation to Simon's forward scheduling 
algorithm for independent unit length jobs. For scheduling unit length jobs with arbitrary release 
times and deadlines on B processors to minimize the sum of completion times, Simons and Warmuth 
[55| extended the work by Simons [53] giving & n algorithm with running time 0{n 2 B) to find a 
feasible solution. For constant B, the running time is improved in [48]. 

A closely related problem of minimizing busy time has been recently studied by Khandekar, 
Schieber, Shachnai and Tamir [44]. In the busy time problem, jobs of arbitrary length have indi- 
vidual demands r,-. The jobs have release times and deadlines and need to be scheduled in batches, 
with the additional requirement that the total demand of jobs in the batch at any point of time 
is at most a given value. Each batch is scheduled on a single machine. The busy time of a batch 
is defined as the busy time of the machine that schedules it, i.e. the duration of the earliest start 
time to latest end time of jobs in the batch. We highlight that their model permits access to an 
unbounded number of machines; thus every instance is feasible, albeit with potentially high cost. 
Our problem is slightly different in that we have uniform demands (as in [23]), and we do not 
have an unbounded number of machines. The non-unit length generalization makes the problem 
iVP-hard, even for the uniform demand case [231 156j . In [23], the authors consider the uniform 



4 



demand case and present a 4- approximation as well as results for the special case where the jobs 
are interval jobs, i.e. the processing time is exactly the length of the interval. In [33], they consider 
a more general problem and develop an approximation algorithm with a factor of 5. Their main 
idea is to first "cluster" the jobs with the assumption that each batch has infinite capacity and then 
fix this as the position of the job by modifying the release time and deadline, thus converting it to 
an interval job. The main algorithm then partitions the jobs by demand into two categories and 
uses a greedy method to schedule the jobs. A number of applications are mentioned in [44^ I23j . 

Baptiste [7] examines a related problem of "min gap" scheduling unit length jobs on a single 
processor to minimize the number of idle intervals; in this model, the algorithm determines when 
the processor sleeps. They give an optimal dynamic programming algorithm which builds from a 
dominance property of the optimal offline schedule. Baptiste, Chrobak and Diirr in [8] improve 
the running time; their algorithm in fact applies to the generalized problem in which jobs have 
arbitrary processing times. This work was subsequently extended to handle multiple processors 
by Demaine et. al. [18] , who also provide an approximation algorithm for the case where each 
job has multiple intervals in which it can be scheduled. They also give logn lower bounds on the 
approximation ratio. The cost function in this lower bound does not apply to the problems studied 
in this paper. 

Also related is the dynamic speed scaling problem, in which the scheduler determines the non- 
negative speed at which the processor runs. For a single processor, Yao et al. [59] give an exact 
offline solution that minimizes the total power consumption when the power is a convex function 
of the speed. Irani et al. |40j study an extended problem in which the machine can also be put 
into the "sleep" state, during which period no cost is incurred other than the constant wake-up 
cost. They present a 2-approximation and an 0(l)-competitive algorithm in the online setting. 
The problem was recently shown to be ./VP-complete and the approximation improved to 4/3 by 
Albers and Antoniadis [3]. Despite the significant results in [3D], its authors acknowledge that a 
continuous power function is unrealistic; in practice, systems run at a finite number of potential 
speeds. Our work is the special case in which power is represented by a step function. 

Li and Yao [47] consider a discretized version of the problem, in which the system may operate 
at one of a finite number of speeds. Their algorithm is exact and runs in time O(dnlogn), where d 
is the number of possible speeds. The main idea is to first partition the jobs, and then to determine 
the speeds of these jobs, partition by partition. However, their model is not quite the same as 
ours; despite the discretization of the speeds, they still assume that the underlying power function 
is convex and therefore cannot capture the step from speed to positive speed. 

Demaine et. al. |18] investigated problems involving multiple processors and more generally, 
multiple feasible intervals for jobs. For the multiple processors settings, they provide a polynomial- 
time algorithm which minimizes the total number of gaps in the schedule. The algorithm also 
minimizes the total transition energy plus total time in active slots, over multiple processors. Notice 
that this setting is not quite a generalization of ours, since the total active time is summed over 
each processor. Unlike our cost model, it is cheaper to activate fewer rather than more processors 
at any given time. Finally, they give for the multi- interval setting a (1 + (| + e)a)-approximation, 
where a is the cost to transition to the active state. Notice that their cost model is very closely 
related to ours, with the exception that we consider settings in which the startup cost is negligible 
enough to be assumed zero. Thus, their lower bounds on the approximation ratio, which explicitly 
assume a non-zero a, do not apply to our problem. 

Demaine and Zadimoghaddam [19] recently studied the problem of minimizing energy consump- 
tion in schedules over multiple processors. Their model is quite general in that the feasible time 
slots in which each unit length job may be scheduled may not comprise a single time interval. Also, 
each processor has an arbitrary (unrelated) energy function and can (but doesn't have to) go to 
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a sleep state. They provide a 0(log n)-approximation for this problem by first proving a general 
result for the submodular maximization problem with budget constraints, and then reducing their 
scheduling problem to a matching problem on a bipartite graph with a submodular matching func- 
tion. They also show that the problem is Set-Cover-hard, thus demonstrating the tightness of their 
result. 

We refer the reader to surveys [411 [TJ [2] for a more comprehensive overview of the latest schedul- 
ing results for power management problems. 

2 Lazy Activation for Unit Jobs and Single Execution Windows 

We first provide a high level description of the algorithm, followed by pseudo-code and the proof 
of optimality. We may assume that the instance is feasible, since this is easy to check by an EDF 
computation. 

Denote the distinct deadlines by d^ < di 2 < . . . < di k , and let S p be the set of jobs with 
deadline di p . Then S\ U S2 ■ ■ ■ U Sk is the entire set of jobs where the deadlines range from time 1 
to T = di k . It is not hard to argue that w.l.o.g. T is 0(n). 

We process the jobs in two phases. In Phase I we scan the jobs in order of decreasing deadline. 
We do not schedule any jobs, but only modify the deadlines of jobs to create a new instance, 
whose optimal solution is equivalent to that of the original instance. The desired property of the 
new instance is that at most B jobs will have the same deadline. Process the time slots from 
right to left. At slot D, let S be the set of jobs that currently have deadline D. From S, select 
max(0, \S\ — B) jobs with earliest release times and decrement their deadlines by one. If |,S| < B 
then we do not modify the deadlines of jobs in S. (Note that a job may have its deadline reduced 
multiple times since it may be processed repeatedly.) 

Assume for simplicity's sake that after the first phase, S p refers to the jobs of (modified) 
deadline di . We now describe Phase II in which jobs are actually scheduled. Initially all jobs are 
unscheduled. As the algorithm assigns jobs to active time slots, we change the status of jobs to 
scheduled. Once a job is scheduled, it remains scheduled. Once a slot is declared active, it will 
remain active for the entire duration of the algorithm. 

Our algorithm, in general, schedules jobs in increasing order by deadline. As we will see shortly, 
this is not quite true since we may schedule some jobs with later deadlines whenever there is 
available space. In precise terms, we schedule the time slots di p from left to right. To schedule 
di , if there are still unscheduled jobs with that deadline we schedule them. If fewer than B jobs 
get scheduled, we schedule additional jobs that are available. Job j is available if it is currently 
unscheduled and has di p G [rj,dj). We schedule these available jobs EDF, until the slot is full or 
no more jobs are available. 

2.1 Formal Algorithm Description 

Let B > be the number of jobs that the system can satisfy in a single time slot. For every job 
j, denote j's release time and deadline by rj and dj, respectively. We index the n jobs in order of 
increasing deadline, i.e. such that d\ < di < . . . < d n = T. Normalize to the earliest release time 
of any job. Then without loss, all feasible schedules are active only within the interval [0, T]. 

In the first phase we scan the jobs from right to left in decreasing deadline order. At each step 
we consider the set of jobs with a common deadline and leave up to B jobs with the latest release 
times untouched. We modify the deadlines of the rest of the jobs in this set, decrementing them 
each by one, and then continue processing the jobs. 
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The algorithm for the second phase simultaneously maintains a set W of active time slots and a 
set J of satisfied jobs, both of which are initially empty. In each iteration, we look at the unsatisfied 
job j* of earliest deadline, i.e. j* = arg min^j dj. Let d* be j*'s deadline, and let J* be the set 
of all unsatisfied jobs with the same deadline d* . The algorithm activates the latest possible time 
slot which can satisfy J* and adds it to W. Only one slot is needed to satisfy J* since \J*\ < B. 

The algorithm assigns jobs to the newly activated time slot t first adding J* to J and assigning 
those jobs to slot t. If fewer than B jobs are available, we fill the remaining space by selecting 
available (filler) jobs from the remaining set of unscheduled jobs, again selecting based on EDF. 
These filler jobs will then be added to J. 

The following pseudocode formalizes the above description of the second phase. SelectFillers 
takes as input the set of active slots W and the set of scheduled jobs J, returning the set of filler 
jobs J'. These jobs are then added to the set of scheduled jobs. 



Algorithm 1: Lazy Activation Algorithm 

J^0; W^Q ; 
while 3 j ^ J do 

d* <— argmhx^j dj ; 
J*^{j$J: dj = d*} ; 
W <- W U d* ; 
J <- J U J* ; 

J' <- SelectFillers(W, J) ; 
J <- J U J' ; 



Algorithm 2: SelectFillersfW, J) 
Choose available fillers based on EDF to fill the B — \ J*\ empty spots. 



2.2 Analysis of the Algorithm 

It is easy to implement our algorithm in running time 0(n log n) using standard data structures. 
What is not completely obvious is why it computes an optimal solution. 

Suppose the initial instance I is transformed by Phase I to a modified instance I'. We prove 
the following properties about an optimal solution for I'. 

Proposition 1 An optimal solution for I' has the same number of active slots as an optimal 
solution for the original instance I. 

Proof: It is easy to see that any feasible solution for I' is feasible for / since pre-processing only 
created a more constrained instance in the transformation; each job's window in i 7 is a subset of 
its window in the original instance /. We now argue that a solution for / can be transformed to 
a solution for /' using the same number of slots. Suppose a feasible schedule a for / is infeasible 
for I' due to a job x. In other words, a schedules job x after its modified deadline in We can 
argue this step by step, by showing that decrementing the deadline of a single job does not change 
the optimal solution; since the modification is done by a sequence of such operations, the optimal 
solution is preserved. Assume that the deadline of x was reduced by one. In the instance I', we 
have B jobs with deadline d x , out of which at most B — 1 jobs can be scheduled with x. Hence 
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there is at least one job scheduled earlier whose deadline is still d x . Since its release time cannot 
be before the release time of x, we can exchange these two jobs. This makes the schedule feasible 
for I', and this establishes the proposition. □ 



Proposition 2 Without loss of generality an optimal solution for I' uses a subset of slots that are 
deadlines. 

Proof: Among all optimal solutions select the one that uses the least number of non-deadline slots. 
Among all the slots that are not deadlines, choose the slot t that is the right-most (i.e. latest) active 
slot. Let X be the set of jobs which are assigned to t. We now move X as late as possible while 
maintaining feasibility for every job in X. We will end up next to a deadline slot t' containing a set 
of blocking jobs B; otherwise we will have reduced the number of non-deadline slots. From the set 
of jobs X U B, select B jobs with the earliest deadlines and schedule them in t'. Since at most B 
jobs have deadline t' , the remaining jobs all have higher deadlines and the process can be repeated. 

□ 

Theorem 3 Algorithm Lazy Activation computes an optimal solution for V (with the smallest 
number of active slots). 

Proof: It is enough to prove that the optimal solution w.l.o.g. schedules exactly the same jobs that 
we schedule in the first active slot (earliest deadline d±). By removing these jobs from the instance 
I', it is easy to see that our algorithm computes an optimal solution. The proof for the claim is as 
follows. Due to the above propositions, the jobs with deadline d\ w.l.o.g. are all scheduled in time 
slot d\ in the optimal solution; Algorithm Lazy Activation also schedules all the jobs in this time 
slot. In addition, observe that to fill the remaining slots among the available slots, the optimal 
solution again w.l.o.g. selects available jobs with earliest deadline, i.e. the filler jobs chosen by 
Algorithm Lazy Activation. Otherwise we can exchange the jobs to achieve this property. The 
proof follows by induction. □ 

2.3 On Infeasible Instances 

In this section, we consider the behavior of the Lazy Activation algorithm on instances for which it 
is impossible to schedule all jobs within their individual windows of feasibility. We will show that 
Lazy Activation maximizes the number of jobs satisfied. In fact, we will see that it does so in the 
fewest number of active timeslots. Denote by Sla the schedule returned by Lazy Activation. In 
Phase I, it is possible for a job's deadline to be decremented all the way to its release time; in this 
case, we say that the job's window has collapsed. 

Proposition 4 On infeasible instances, Lazy Activation maximizes the number of jobs satisfied. 

Proof: One can use an argument similar to Proposition 1 to show that even if Phase I collapses 
the windows of some jobs, it does not change the maximum number of jobs that can be scheduled. 
For completeness' sake, we detail it here. As before, we argue step-by-step that each decrement 
changing the instance from / to I 1 does not affect the maximum throughput. Suppose a deadline 
d x of job x is reduced by one. Let a be a feasible schedule on / achieving maximum throughput 
on /. We can transform a into a' that is feasible on instance I 1 and that satisfies the same number 
of jobs. If o does not schedule job s at 4, then a is already feasible on I'. Suppose a schedules 
x at d x . Then a can do at most B — 1 other jobs at d x . In I', there are B jobs with deadline d x 
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and release time > r x . Thus, there exists a job j which is not scheduled by a at d x and which has 
release time at least r x . Modify a by swapping jobs j and x. (If j was not scheduled in a, then 
schedule j and not x.) The new schedule satisfies the same number of jobs and is also feasible for 

Thus, the modification of deadlines in Phase I does not change the maximum number of jobs 
which can be scheduled. In particular, jobs whose windows collapse in Phase I w.l.o.g. are also 
dropped in some throughput-maximizing schedule. In Phase II, Lazy Activation schedules every 
job whose window has not collapse, either at its deadline or earlier, i.e. as a filler. Therefore, Lazy 
Activation maximizes the number of jobs satisfied. □ 

Proposition 5 // Lazy Activation's schedule Sla satisfies n' < n jobs in k active slots, then any 
schedule S satisfying n' jobs does so in at least k active time slots. 

Proof: Suppose that Lazy Activation collapses k = n — n' jobs, denoted j ai . . . j aK . Let at be the 
number of jobs that had deadline t at the start of the iteration in which Phase I processed t as 
the deadline. Deadlines will be decremented precisely when at > B. For each collapsed job j ai , 
we will identify an interval I ai of excess demand by intuitively "unrolling" the iterations of Phase 
I starting from the point of collapse. More formally, set t' to the latest slot such that every slot 
t E is such that at > B. Define I ai to be [rj,t'). Notice that t' is the original deadline of 

some job. Let J ai = {j : [rj,dj) C I a .}. Then the collapsed job j ai G J ai and also |J 0j | > B ■ \I ai \- 
In fact, J ai consists exactly of two types of jobs: jobs which have collapsed and B ■ \I ai \ jobs which 
have not collapsed. Lazy Activation schedules the latter job set in I ai at B jobs per slot. 

Now partition the original instance (J, T) into two subinstances (J\,T\) and (J2,T 2 ), where 
J\ = Ui=i and T\ = Ui=i-W Obviously jobs of J\ cannot be scheduled in slots of T2 by 
definition. We claim that since S maximizes throughput, it necessarily schedules J2 in T2. Suppose 
there exists a job j G J% that is scheduled by S in some interval I ai . Then since \ J ai \ > B ■ \I ai \, 
there is some job of J ai that is missed by S. Since it is possible to schedule all jobs of J2 only in slots 
of T2 (Sla is such an example), missing that many jobs of J ai was unnecessary. This contradicts 
the fact that S maximizes throughput. 

Then the active time A(S) of S (Sla, respectively) can be decomposed into two components: 
the active time A\(S) spent satisfying J\ and the active time spent satisfying J2. Then, 

A(S) = A 1 (S) + A 2 (S) 

> A^Sla) + A 2 (S LA ) 
= A(S LA ) 
= k 

where the inequality follows from the facts that (1) Lazy Activation minimizes active time on feasible 
instances and (2) there are exactly B ■ \I ai \ jobs contained in each J 0j that have not collapsed and 
Lazy Activation schedules all of them. Thus whenever Lazy Activation is active in T2, it schedules 
B jobs per slot. □ 

2.4 Linear Time Implementation 

We conclude by showing that the algorithm can be implemented in time 0(n + T). We start by 
giving the following equivalent version of Phase I, which we refer to as Phase I'. Note that in 
"assigning" jobs to deadlines, both Phases are only preprocessing deadlines. The actual schedule 
is not developed until Phase II. 
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Initially each new deadline value has no jobs assigned to it. Process the jobs j in order of 
decreasing release time rj (jobs with the same release time can be processed in arbitrary order). 
Assign j to a new deadline equal to the largest value less than or equal to dj that currently has 
< B jobs assigned to it. 

To prove that Phase I' computes the same deadlines as Phase I, assume that both algorithms 
break ties for release times the same way. Let Dj and D'- denote the deadlines to which job j 
is assigned by Phase I and Phase I', respectively. We show that for all j, D'- is equal to Dj, 
by induction on Phase I"s iterations. Clearly this holds for the first iteration, since the first job 
considered has maximum release time, so Phase I never decrements its deadline. Therefore, both 
Phases assign it to its original deadline. Now consider some job j and suppose that equality holds 
for all jobs processed it iterations previous to that of job j. Then Dj < D'y Dj is an available 
candidate deadline for Phase I', which chooses the latest such one. Thus, when Phase I assigns jobs 
to deadline Dj, it considers assigning job j (feasible and unassigned) there as well. In fact, Phase 
I assigns to D'- the jobs that Phase I' has already assigned there. There are less than B of them, 
each having release time at least Tj. Among the remaining jobs which are feasible at Dj, job j has 
the maximum release time so Phase I assigns j to D'-. Therefore, Dj = D'-, as claimed. 

J •* J 

We implement Phase I' using an algorithm for disjoint set merging on the universe of possible 

deadline values 1 T. Each set consists of a deadline D to which < B jobs have been assigned, 

plus the maximal set of consecutive values D + 1, . . . that each have B assigned jobs. It is easy to 
see that using the set merging data structure of [33J achieves time 0{n + T) for Phase I'. 

Phase II constructs an earliest-deadline first schedule on the timeslots that are Phase I deadlines. 
In particular, let D\ < D2 < ■ ■ ■ < Dg be the distinct deadline values assigned to jobs in Phase I. 
Modify release times and deadlines to compress the active interval from [0,T] to [0,£], as follows: a 
job j with Phase I deadline Dk gets deadline k; its release time rj gets changed to the largest integer 
i such that D{ < rj (take Dq = 0). The algorithm computes an earliest-deadline first schedule on 
this new instance. It returns the corresponding decompressed schedule, with slot i changed back 
to A- 

Phase II begins by assigning the new release times in time 0(n + T). Then it constructs an 
EDF schedule using disjoint set merging as in Phase I' [21], achieving time 0{n + T) |33j. 

2.5 Slotted versus Unslotted Model 

Here we show that even if the time model is not slotted, the fact that the release times and deadlines 
are integral implies that without loss, the optimal solution schedules jobs in slots implied by integer 
time boundaries. 

Theorem 6 Without loss of generality, the optimal solution schedules jobs so that they start and 
end at integral time points. 

Proof: Call a job non-integral if it starts (and ends) at a non-integral time. Suppose an optimal 
solution O has non-integral jobs. We modify O to another optimal schedule A4 with fewer such 
jobs. Doing this repeatedly eliminates all non-integral jobs. 

Let I be the total inactive time of O's schedule, i.e., an optimal schedule maximizes /. In O, 
let [t, t + 1) be the last interval (for t an integer) in which a non-integral job ends. 

Case 1: Some processor is busy throughout the interval [t,t + 1) (e.g., for unit jobs this means 
some job starts at t). Construct A4 by delaying every non-integral job which ends in this slot so 
that it ends at t + 1 instead. I does not decrease, since every job gets moved into a time period 
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that was already active in O. Clearly, the number of non-integral jobs decreases. 

Case 2: No processor is busy during the entire interval [t,t + 1). Let i be the largest integer 
< t such that an interval of positive length [i,i') is inactive (i 1 > i). If no such interval exists, let 
i = 0. 

Construct Ai by taking every non-integral job that starts after i and advancing its start time 
by e = min{s — \_s\ : s > i the starting time of a non-integral job}. Obviously < e < 1. Observe 
that the definition of i implies than a non-integral jobs advances if and only if it ends after i. 

Ai is a valid schedule: the choice of e guarantees that no release time is violated. We claim 
that no processor executes more than one job at any point in time, i.e. no job j advances into the 
execution of a job j' that does not advance. In proof, if j' is integral, then it ends at an integral 
time. The definition of e shows that j does not advance past an integral time. If f is non-integral, 
then it ends at or before i. Again, the definition of e implies that j does not advance past integer 
time i. 

To show that Ai is optimal, we will show that / does not change. Specifically, we show that I 
increases by e at the end of the schedule, / does not change in the middle, and / decreases by e at 
the start. 

In interval [t,t + 1), every job advances by e (by Case 2), so I increases by e in this interval. 

Next, consider a maximal interval [a, b) that is inactive in O and starts at a > i. We claim that 
Ai is inactive during a corresponding interval [a' , b') of the same or greater length. The definition 
of i implies that a is non-integral, i.e., a is the ending time of a non-integral job. This ending time 
moves to a' = a — e. If b is the start time of a non-integral job, that start moves to b' = b — e. 
Clearly, [a', b') is inactive in Ai . The other possibility is that b starts an integral job. Setting b' = b 
gives an inactive interval of greater length. 

If i = 0, we have shown that Ai has more inactive time than O, a contradiction. If i > 0, the 
interval [i,i') shrinks to an interval of inactive time [i,i' — e) in Ai. This decreases I by e. We 
conclude that Ai is an optimal schedule. The definition of e implies that Ai has more integral jobs 
than O. □ 



3 Unit Jobs with Arbitrary Release Times and Deadlines 

In this section, we consider a generalization of the previously discussed problem. Suppose that the 
unit length jobs have release times and deadlines over the reals and that we want to minimize the 
number of batches in the schedule, where a batch is a set of jobs which all start and finish at the 
same time and where the system can work on at most one batch at a time. This objective is slightly 
more restrictive than active time; even so, scheduling unit length jobs with integer release times 
and deadlines to minimize active time is clearly a special case of this, since in the former problem, 
the batch property holds without loss. 

We describe a simple dynamic program which determines in 0(n 8 ) time the minimum number 
of batches needed to non-preemptively satisfy all jobs, i.e. l|p-batch, B < n,ri,di,pi = l\K, where 
K is the number of batches in the schedule. The dynamic program (and therefore, the notation) 
is similar to that found in [6], on which several DP results in this area of scheduling are based. 
Suppose throughout that jobs are listed in ascending order by deadline, i.e. d\ < di < . . . < d n . 

Definition 1 For job k and for interval [te,t r ] where r/% £ [t£,t r ] Q-nd t r + 1 < dt, let the set of 
jobs Uk{te,t r ) = {j < k : r,j G [t£,i r ]} (see Fig. [1]). Also let Uo(t£,t r ) = for all intervals [tg,t r ]. 
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Figure 1: Jobs of Uk(t£,t r ). 



Observe that the jobs in Uk(te, t r ) must be scheduled entirely in [tj>, d^] in any feasible schedule. 
We restrict ourselves without loss to the space of schedules which obey the EDF Principle: for any 
pair of jobs i and j where i < j, either job i is scheduled before or with job j, or rj is after the 
point at which job j is scheduled. Indeed, if a schedule contains a pair of jobs i and j for which this 
does not hold, then one can swap them in the schedule without affecting feasibility or the number 
of batches. 

Definition 2 Let P(k,t£,t r , fi r ) be the minimum number of batches required to schedule Uk(te,t r ) 
such that (1) all batches start within [tg + l,t r ] and (2) at most B — fj, r of these jobs are scheduled 
in the batch starting at t r , and if fJ, r > 0, this coming at no additional cost. 

If such a schedule does not exist, e.g. k > and + 1 > t r , then let P(k,te,t r , fi r ) = oo. Let 
Pt(k, tg,t r , fi r ) be the value of P(k, te, t r , fi r ) subject to job k being scheduled in a batch that starts 
at time t. Then 



Per Baptiste's observation in [7], it is enough to iterate over a set of 0(n 2 ) possible starting times 
for batches. For a given t, one can compute Pt(k,ti,t r , fj, r ) as follows. Let L = {j < k : rj G [t£,t]} 
be the jobs of Uk{te,t r ) which are released before or at t. Similarly, let R = {j < k : rj G (Mr]} 
be the jobs of Uk(t£,t r ) released after t. The observation above implies that if i G L, then i is 
scheduled before or at t. On the other hand, if i € R, then job i must be scheduled after t. Let 
kh G arg max igi \{ fc } di, or if L\{k} = 0. Similarly, let ku G argmax^g^ dj, or if R = 0. 



If t = t r , then L = Uk(t£,t r ), R is empty and k^ (if positive) is the second-to-latest deadline in 
U k (t£,t r ). Then, 



Consider t < t r . If fj, r > 0, then for t G (t r — l,t r ), Pt(k,t£,t r , fj, r ) = oo, since starting batches at 
t and at t r will violate the constraint that at any given time, there is at most one batch running. 
Otherwise, i.e. if t < t r — 1 or fj, r = 0, 



P(k,t£,t r ,fi r ) 



minP t (fc, te, t r ,n r ) 




if < fi r < B 
if fj, r = 
otherwise 



(1) 



P t (k,t£,t r ,fj, r ) = 1 + P(k L ,t£,t, 1) + P(k R ,t,t r ,Hr) 
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Finding the optimal number of batches is equivalent to computing P(n, minj r, — p,d n — 1, 0). 
Since there are O(n) jobs and 0{n 2 ) interesting times to consider, the total running time is 
0(n B) = 0(n 8 ). The running time was recently improved to 0(n 3 ) in |42j . 

4 Maximizing Throughput Subject to Batch Size Constraint 

For infeasible instances, one can apply a technique of the same vein to maximize throughput subject 
to an upper bound /C on the number of batches. Define P(k, tg, t r , /i r , k) to be the maximum number 
of jobs of Uk(t£,t r ) which can be scheduled in at most k batches starting in [tg + l,t r ], where as 
before, if [i r > 0, one can schedule up to B — /j, r jobs in the batch starting at t r without incurring 
cost toward the batch budget k. 

To understand the computation of P(k, tg, t r , fi r , k), let t be the starting time of the batch which 
satisfies job k (if such a time exists). Consider where the rest of Uk{tg,t r ) may be satisfied within 
[tg + l,t r ]. Denote by a (/3, respectively) the number of them which start in [tg + l,t] ((t,t r ], 
respectively). Observe that f3 < k — a. 

There are four cases. Suppose job k is not satisfied in the optimal solution. Then P{k, tg, t r , fi r , k) = 
P(k' , tg, t r , fj, r , k) where k' = argmax£ 6 £/ fe ( t<)tr \\ r k \ di, i.e. the job of next latest deadline in Uk{tg, t r ). 
(If Uk(ti, t r ) = 0, then k! = 0.) If job k is satisfied in the schedule, then define L, R, k^ and kn as 
in Section 3. Suppose that job k is satisfied in the batch starting at t = t r . Then 

r l + P(k L ,ti,t r ,l,K-l) if/i r = 
P(k,ti,t r , pb r , k) = < 1 + P(ki, tg, t r , fi r + 1, k) if < [i r < B (2) 
[ — oo otherwise 

If job k is satisfied in a batch which starts at t < t r — 1, or if fx r = and t < t r , then let a be 
the number of batches used to satisfy jobs in L; this includes the batch satisfying job k, so a > 0. 
Then (3 = k — a is the budget on the number of batches used to satisfy R. Note that for the case 
where \i r = 0, if R is not empty, then any a corresponding to a feasible schedule will be strictly 
less than k, i.e. f3 > 0. 

{max <Q,< K (l + P(k L ,tg, t, 1, a - 1) if < fi r < B 
+P(k R ,t,t r ,/j, r ,K - a)) (3) 
P(k, tg, t r — 1, 0, k) otherwise 

Finally, if [i r > and t G (t r — l,t r ), then P(k,tg,t r , fi r , k) = — oo, since the machine cannot 
work on multiple batches at the same time. P(k,ti,t r , fi r , k) is the maximum over these values. 
Since there are 0(n) possible values for k, 0(n 2 ) possible values for both t r and tg, and 0(n) 
possible values for fj, r , and since k = 0(n), there are 0(n ) possible values of P(k, tg, t r , fj, r , k) that 
need to be computed. Computing each one costs O(n), yielding a total running time of 0(n 7 K,). 
We note that if the time model is slotted, then there are 0(n) possible values for both t r and tg, 
and the total running time is instead 0(n 5 /C). This algorithm can be extended to the case in which 
each job has an associated profit and the goal is to schedule a profit-maximizing set of jobs 
in K. slots. 

5 Disjoint Collection of Windows 

In this section, we discuss the more general problem in which each job's feasible regions Tj are 
arbitrary (as opposed to being a single interval). For any fixed B > 3, we show that the problem 
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is ./VP-hard and discuss its relationship to other classic covering problems. We also develop an 
efficient algorithm for the two processor case {B = 2). 

5.1 Connections to Other Problems 

Capacitated Vertex Cover. There is an interesting relationship between the activation problem 
of unit-length jobs with multiple feasible windows and the problem of vertex cover with capacity 
constraints. In the latter problem, we are given a graph G = (V, E) and a capacity function k(v) 
for each vertex. The goal is to pick a subset S of vertices and to determine an assignment of edges 
to vertices in S, so that each edge is assigned to an incident vertex in S, and so that each vertex v 
has at most k{v) adjacent edges assigned to it. When multiple copies of a vertex may be chosen to 
be part of the cover, a primal-dual 2-approximation is given in [30]. In the hard capacities version 
of the problem (VCHC), a bounded number of copies of a vertex may be selected. There is a 
2-approximation using LP rounding [29J, improving the previous bound of 3 given in |14j . 

In the special case of the activation problem in which each job has exactly two feasible time 
slots, there is an equivalence with VCHC with uniform capacities k{v) = B. Since jobs need not 
have adjacent feasible time slots, one can view time as a set of slots rather than an ordering of 
slots. Then, the equivalence follows: the slots are the vertices and the jobs are the edges. For any 
valid capacity-respecting vertex cover of size C, there exists a corresponding activation schedule of 
cost C (and vice- versa). One implication of our result is that we can solve VCHC optimally when 
k(v) = 2. Furthermore, the 2-approximation for VCHC in [29] applies to the activation problem 
when each job has two feasible slots. There is a similar relationship between VCHC on hypergraphs 
and the activation problem where jobs may have more than 2 feasible slots, and for size g hyper 
edges, an 0(g) approximation has recently been developed [52] . 

Capacitated K-center. Another previously studied problem is the .fT-center problem with load 
capacity constraints |10|I43]. Given an edge weighted graph satisfying the metric property, the goal 
is to pick K nodes (called centers) and assign each vertex to a chosen center that is "close" to it. 
We should not assign more than L nodes to any chosen center and want to minimize the value <i max 
such that each node is assigned to a center within distance d max of it. Previous work culminated 
in a 5-approximation for the problem where multiple copies of the same node may be chosen as a 
center. If only one copy can be chosen then the bound goes up to 6. 

There is the following correspondence between this and the following instance of the activation 
problem. First, guess the maximum value d max and create the (unweighted) graph G dmax induced 
by edges having weight at most d max . Then create a job Jj and a timeslot fj for each node i in the 
original graph G, and let job Jj be feasible in slot tj if and only if node i is within distance d max 
of node (or center) j in G, i.e. if edge exists in G max . (So U should be a feasible slot for Jj.) 
Then, finding K centers to which all n vertices can be assigned in G max is equivalent to opening 
K timeslots and feasibly scheduling all n jobs in them so that no slot is assigned more than B jobs, 
where B = L. Since we can solve the activation problem when B = 2, we can optimally solve the 
-RT-center problem when L = 2. 

Capacitated Facility Location. Given a set of facilities (with capacities) and clients, the goal is 
to open a subset of facilities and find a capacity-respecting assignment of clients to open facilities 
that minimizes the sum of facility opening costs and connection costs. See \13\ |4"6] for approxima- 
tion algorithms for this problem. Here, we show that when capacities are uniformly 2, the problem 
can be solved optimally in polynomial time. For clarity's sake, the details of the connection are 
described at the end of Section 15.31 
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5.2 Proof of iVP-hardness for B=3 



We prove that the activation problem for jobs of arbitrary Tj is ./VP-hard when B = 3 via a 
reduction from 3 EXACT COVER, which is known to be iVP-hard [M]. Given a collection X of n 
elements and a collection of subsets Si, ... , S m , each containing exactly three elements from X. Is 
there a sub-collection of exactly ^ subsets that exactly cover all of X? We can view this problem as 
a bipartite graph where one side has the elements X and the other side has a vertex for each subset 
Si, and edges denote membership. This problem maps to the question of finding a dominating 
set of size exactly ^ where we are only allowed to select from the side containing subsets. The 
relationship with the scheduling problem is now obvious: selecting a subset is akin to activating a 
certain time slot where the set X corresponds to a collection of jobs. The edges specify in which 
time slots a job can be scheduled. Since the subsets have size exactly 3, there is an exact cover of 
size ^ if and only if there is a schedule with exactly ^ active slots. 

We observe that there is an O(logn) approximation for this problem (for any B) by an easy 
reduction to a submodular cover problem. This result follows from the classical covering results 
due to Wolsey [57]. 

5.3 Polynomial solution for B=2 

This subsection considers the problem of scheduling jobs in the fewest number of active slots, when 
there are two processors and each job can be scheduled in a specified subset of time slots. We use 
the following graph G. The vertex set is JUT, where J is the set of all jobs and T the set of all 
time slots. The edge set is 

{(j, t) : job j can be scheduled in time slot t} U {(t, t) : t G T}. 

A degree-constrained subgraph (DCS) problem is defined on G by the degree constraints d{j) < 1 
for every j £ J and d(t) < 2 for every t E T. By definition a loop (t, t) contributes two to the 
degree of t. 

Any DCS gives a schedule, and any schedule gives a DCS that contains a loop in every nonactive 
time slot. Also any DCS D that covers t jobs and contains A loops has cardinality 

\D\ = t + X. (4) 

Lemma 1 A maximum cardinality DCS D of G minimizes the number of active slots used by any 
schedule of \ V(D) n J\ jobs. 

Proof: Let D be a maximum cardinality DCS. D covers i = \ V(D)f] J\ jobs, (jl]) shows no schedule 
of l jobs contains more loops that D. D contains the loop (t, t) precisely when t is not active. Thus 
D minimizes the number of active slots for schedules of t jobs. □ 

To state the time bounds of this section, let n be the number of jobs and m the number of given 
pairs (j,t) where job j can be scheduled at time t. Observe that G has 0(m) vertices and 0(m) 
edges, since we can assume every time slot t is on some edge (j,t). In any graph, the majority of 
edges in a simple non-trivial path are not loops. So in the algorithms for maximum cardinality DCS 
and maximum matching, an augmenting path has length 0(n). A maximum cardinality DCS on 
G can be found in time 0{y/nm). The approach is through non-bipartite matching. We describe 
two ways to handle the loops. 

The first approach reduces the problem to maximum cardinality matching in a graph called the 
MG graph. To do this modify G, replacing each time slot vertex t by two vertices ii,t2- Replace 



15 



each edge (j,t) by two edges from j to the two replacement vertices for t. Finally replace each loop 
(t, t) by an edge joining the two corresponding replacement vertices. 

A DCS D corresponds to a matching M in a natural way: If a time slot t is active in D then 
M matches corresponding edges of the form (j,ti). If the loop (t,t) is in D then M matches the 
replacement edge (ii,t2). Thus it is easy to see that a maximum cardinality DCS corresponds to 
a maximum cardinality matching of the same size. The cardinality matching algorithm of Micali 
and Vazirani |50[ [33] gives the desired time bound. (This approach works for the versions of the 
problem that we will consider on unit jobs; it does not work when the jobs have longer length since 
a pair of edges (j,ti), (j,^) might get matched.) 

A second approach is to use an algorithm for maximum cardinality DCS on general graphs. If 
such an algorithm does not handle loops directly, modify G as follows by replacing each loop (t, t) 
by a triangle (t,t\,t2,t), where each is a new vertex with degree constraint d(ti) = 1. A DCS 
in G corresponds to a DCS in the new graph that contains exactly \T\ more edges, one from each 
triangle. The cardinality algorithm of Gabow and Tarjan [32] gives the desired time bound. 

Now let i* be the greatest number of jobs that can be scheduled. Clearly t* can be computed in 
time 0(y/nm) by finding a maximum cardinality DCS on the bipartite graph formed by removing 
all loops from G [20J. 

Theorem 7 A schedule for the greatest possible number of jobs (i* ) that minimizes the number of 
active slots can be found in time 0{\fnm). 

Proof: The algorithm first finds a DCS Dq of i* jobs, as described above. It converts Dq to the 
desired DCS by using an algorithm to find a maximum cardinality DCS on G, using Dq as the 
initial DCS. 

The correctness of this approach follows from the fact that the DCS algorithm works by aug- 
menting paths. This implies that as the initial DCS Dq is enlarged to the final DCS, no vertex's 
degree decreases. Thus the final solution schedules the same jobs as Dq. Now apply Lemma HJ 

Using Dq as the initial DCS does not affect the time bound, since the algorithm begins by finding 
augmenting paths of length 1, i.e., an arbitrary set of edges satisfying the degree constraints [32]. [50] . 

□ 

The proof shows that the choice of i* jobs is irrelevant - the minimum number of active slots 
for a schedule of l* jobs can be achieved using any set of t* jobs that can be scheduled. 

We also note that matching is a natural tool for our power minimization problem: Given a 
maximum cardinality matching problem, we create a job for every vertex and if two vertices are 
adjacent, we create a common time slot in which they can be scheduled. A maximum cardinality 
matching corresponds to a schedule with minimum active time. Thus if T(m) is the optimal possible 
time for our scheduling problem, a maximum cardinality matching on a graph of m edges can be 
found in 0(T(m)) time. For bipartite graphs the number of time slots can be reduced from m to 
n. Thus a maximum cardinality matching on a bipartite graph of n vertices and m edges can be 
found in 0(T(n,m)) time, for T(n, m) the optimal time to find a minimum active time schedule 
for 0(n) jobs, 0(n) time slots, and m pairs (J,i). 

Our algorithm can be extended to other versions of the power minimization problem. For 
example the following corollary models a situation where power is limited. 

Corollary 1 For any given integer a, a schedule for the greatest possible number of jobs using at 
most a active slots can be found in time 0{y/nm). 
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Proof: Start by constructing the DCS D* of Theorem [7J that schedules i* jobs and contains say A* 
loops. Let t* = T\ + 2t2, where D* schedules Tj time slots with i jobs, i = 1,2. Let a = ri + T2 — A. 
We consider the following cases for A. 

Case A < 0: D* has < a active slots. 

Case < A < t±: Choose any A time slots scheduling just one job and unschedule those jobs. 
This gives a schedule D with a active slots. As a DCS D has l* — A jobs and A* + A loops. (|4|) 
shows \D\ = \D*\. Since fixing the number of active slots fixes A, @ also shows D schedules the 
greatest possible number of jobs. 

Case T\ < A: In D* , unschedule the jobs that are scheduled in the t\ time slots with one job, or 
scheduled in A — t\ other time slots (chosen arbitrarily from the ti slots with two jobs). The result 
is a schedule D with a active slots, each processing two jobs. D obviously schedules the greatest 
possible number of jobs. □ 

The proof shows that the DCS D* is easily turned into a table (of t\ + T2 entries) that gives 
the desired schedule for every value of a. 

Now we highlight the connection to capacitated facility location in which capacites are two. Create 
a weighted graph MG W , where jobs correspond to clients and a facility corresponds to a slot, which 
is a pair of nodes (^1,^2) m the graph. Add edges (j,t\) and (j, £2) with weights equivalent to 
the cost of connecting client j to facility t. Also put an edge between every pair of slot nodes t\ 
and t2, with weight —C(t) where C(t) is the cost of opening facility t. Then, finding a minimum 
cost solution to the facility location problem amounts to determining a minimum cost matching in 
MG W in which each job is scheduled, since the former cost is simply the latter plus an additive 
term ^\ C (t) . The matching can be achieved by generalizing our algorithm to the weighted case 
in a natural way. 



6 Preemptive Scheduling for Integral Length Jobs 

This section discusses preemptive scheduling (on the integer time points only) for B = 2. Now each 
job j has an arbitrary integral length £(j), and a set Tj of time slots in which one unit of its length 
can be executed. We wish to assign each job j to exactly £(j) of these time slots, again minimizing 
the number of active time slots. We state several results for this model and sketch their proofs. 

Theorem 8 If a schedule executing every job to completion exists, such a schedule minimizing the 
number of active time slots can be found in time 0(y/~Lm) for L = ^2j£(j)- 

Proof Sketch: The algorithm is similar to Theorem [71 We use the graph G, modified so job j has 
degree constraint d(j) < £(j). Any DCS scheduling every job satisfies (|4|) for 1 = L. The rest of 
the argument is unchanged. The time bound follows from [32]. □ 

Now suppose we cannot schedule all the jobs to completion. Then it is NP-complete to even 
schedule the greatest possible number of jobs (for any fixed number of processors). We show 
the proof for B = 1, the extension for higher B is trivial. The reduction is similar to the NP- 
completeness proof in Subsection 3.1 and is as follows: We create a time slot for each element in 
X. We create a collection of m jobs (one corresponding to each set) of length 3 each. Each job can 
be scheduled in the time slots that correspond to the elements in the corresponding subset. We 
can schedule ^ jobs if and only if there is a solution to the 3 Exact Cover problem. 
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The previous result assumes that £(j) = 3 for all jobs j. For B = 2, the special case where each 
£(j) is 1 or 2 can be modeled by a graph G which is G augmented by a loop for every job j of 
length 2; also each such j has degree constraint d{j) = 2. The following results reduce the problem 
to maximum weight DCS. 

Theorem 9 (a) A schedule for the greatest possible number of jobs, minimizing the number of 

active slots, can be found in time 0{\J na{m, m) log 3 m m). 

(b) For every l\, a schedule of l\ unit jobs plus the greatest possible number of length two jobs, 
minimizing the number of active slots, can be found in the time bound of (a). 

(c) The collection of all schedules of (b) (i.e., a schedule for every possible value of i\) can be 
found in time 0(n(m + n'logn')), for n' the number of jobs and time slots. 

Proof Sketch: (a) Assign edge weights to G: An edge (j, t) weighs m if j is unit and m/2 if j has 
length 2. A loop weighs 1 if it is incident to a time slot and if it is incident to a job. A maximum 
weight DCS maximizes the number of jobs, and subject to that, maximizes the number of idle time 

slots. The algorithm of [32] finds a maximum weight DCS in time 0{ \J na{m, n) log 3 n m) on a 
graph of n vertices, m edges, and integral weights polynomial in n. 

(b) Add a vertex x adjacent to every unit job. Each new edge weighs and d(x), the degree 
constraint of x. is the number of unit jobs decreased by t±. 

(c) In the graph of (b), set d(x) = and find a maximum weight DCS. Then repeatedly increase 
d(x) by 1 and update the maximum weight DCS. Each update amounts to finding one augmenting 
path. The algorithm of [31] finds an augmenting path in time 0(m + nlogn)) on a graph of n 
vertices and m edges. □ 

Part (c) is motivated by the fact that unit jobs and length 2 jobs may differ in value. In the 
same vein, part (b) obviously generalizes to schedules that maximize an arbitrary linear function 
of the number of unit and length 2 jobs. 

7 Active Time and Arbitrary Preemption 

This section considers the case of scheduling a collection of jobs with arbitrary non- negative lengths 
lj and arbitrary sets Tj of feasible intervals, j G J. There are B processors that operate in a 
collection of unit length time slots s G S. Preemptions are allowed at any time, i.e., job j must be 
scheduled in lj units of time but it can be be started and stopped arbitrarily many times, perhaps 
switching processors - the only constraint is that it must never execute on more than one processor 
at any instant of time. We seek a schedule minimizing the active time. 

Note that when we allow preemption the multi-slot jobs of Section [5] correspond to arbitrary 
length jobs. Similarly a special case of this model is preemptively scheduling jobs of arbitrary 
length with integral release times and deadlines, i.e., the generalization of Section [2] which treats 
nonpreemptive unit jobs. 

7.1 Linear program formulation 

The problem can be formulated as a linear program. Form a graph of edges E where job j and slot 
s have an edge js when j can be scheduled in slot s. A variable Xj s gives the amount of time job 
j is scheduled in slot s, and a variable i s gives the amount of idle (inactive) time in slot s. The 
problem is equivalent to the following LP: 
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maximize X^gs *s 

Subject to ^2js€E X js > tj j£J (0 a ) 

E js eE x is + Bi s < B s £ S (0b) (5) 

Xj s + i s < 1 js £ E (|5jc) 

i a , Xj s > s £ S, js £ E 



To see the formulation is correct first observe that although (0 a) is an inequality we can assume 
equality holds, since we need only schedule £j units of job j. 

Secondly, observe that the inequalities (0b)-([5jc) are necessary and sufficient conditions for 
scheduling xj s units of job j, js £ E, in 1 — i s units of time (on B processors). Necessity is clear. 
For sufficiency, order the jobs arbitrarily; order the processors arbitrarily too. Repeatedly schedule 
the next 1 — i s units of jobs on the next processor, until all jobs are scheduled. The last processor 
may not receive a full 1 — i s units of work. Also a job may be split across processors. However such 
a job is scheduled last on one processor and first on the next, so (J5jc) guarantees it is not executed 
simultaneously on two processors. 

We note that the following LP is equivalent to ([5]). Partition 5 into disjoint intervals [a, b) where 
a and b are consecutive integers in the sorted set of interval boundaries over all jobs. Denote these 
intervals I±, . . . , Ik and let E' be the set of edges ij where job j is feasible in Ij. 



maximize Yli=i Vi 

subject to Ysij^E' x ij — 3 ^ J (0 a ) 

EijeE'Xij + B-Vi < B-\Ii\ i = l...k (JBJb) (6) 

Xij +yi < \Ii\ ij £ E' dUc) 

Vi, > i = l...k,ij £ E' 



The variable yi denotes the amount of idle time in interval Ij. The variable x^ denotes the 
amount of time devoted to job j in interval Ij. For jobs of arbitrary length, the size of this LP is 
still polynomial in the number of jobs and the total number of intervals in the sets Tj. 

7.2 The case B = 2 and 2-matchings 

It is convenient to use a variant of ([5]) based on the matching graph MG of Section [5j Each edge of 
MG has a linear programming variable. Specifically each edge js £ E(G) gives rise to two variables 
x e , e = jsj, i £ {1, 2}, that give the amount of time job j is scheduled on processor i in time slot 
s. Also each time slot s £ S, has a variable x e , e = s\S2 that gives the amount of inactive time in 
slot s. The problem is equivalent to the following LP: 



maximize ^{^e : e £ E(MG)} 

subject to ^{^e : e incident to j} = ij j £ J (0a) 

YA%e ■ e incident to s^ < 1 s £ S, i £ {1, 2} (0b) (7) 

E{xe-e£{js 1 ,js 2 ,s 1 s 2 }} < 1 js£E{G) (0c) 

x e > e £ E{MG) 

To see this formulation is correct note that any schedule supplies feasible x e values. Conversely 

any feasible x e 's give a feasible solution to LP (J5]) (specifically set xj s = Xj Sl +Xj S2 and i s = x slS2 ) 
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(a) (b) (Cj 

Figure 2: (a) We illustrate the graph MG with two time slots and three jobs with B = 2. Jobs A 
and B can be scheduled in time slots s and t, while job C can only be scheduled in time slot t. (b) 
We show a solution corresponding to an integral matching and (c) a solution corresponding to a 
triangle free 2-matching with \ unit of idle time in the first slot. 

and so corresponds to a schedule. Finally the objective of ([7]) equals the total of all job lengths 
plus the total inactive time, so like © it maximizes the total inactive time. 

Now consider an arbitrary graph G = (V, E) and the polyhedron denned by the following system 
of linear inequalities: 

^2{x e : e incident to v} < 1 v £ V ([Hla) 

Y,{x e ■ e an edge of T} < 1 T a triangle of G flSb) (8) 

x e > e G E 

Call J2i x e '■ e ^ E} the size of a solution to (|8|). Say vertex v is covered if equality holds in (|8la). 

Define a 2-matching M to be an assignment of weight 0,1 or 1/2 to each edge to each edge of 
G so that each vertex is incident to edges of total weight < lH M is basic if its edges of weight 1/2 
form a collection of (vertex-disjoint) odd cycles. The basic 2-matchings are precisely the vertices 
of the polyhedron determined by the constraints ([8]) with (JHJ b) omitted. A basic 2-matching is 
triangle-free if no triangle has positive weight on all three of its edges. Cornuejols and Pulleyblank 
|16j showed the triangle-free 2-matchings are precisely the vertices of the polyhedron determined 
by constraints ([8|). 

When all job lengths are one, the inequalities of ([7]) are system ([8]) for graph MG, with the 
further requirement that every job vertex is covered. So it is easy to see that the result of Cornuejols 
and Pulleyblank implies our scheduling problem is solved by any triangle-free 2-matching on MG 
that has maximum size subject to the constraint that it covers every job vertex. Also, interestingly, 
there is always solution where each job is scheduled either completely in one time slot or is split 
into two pieces of size 1/2 (see Fig. [2]). 

Cornuejols and Pulleyblank give two augmenting path algorithms for triangle-free 2-matching: 
they find such a matching that is perfect (i.e., every vertex is fully covered) in time 0(nm) [17] ; and 
such a matching that has minimum cost in time 0(n 2 m) [16]. (The latter bound clearly applies 
to our scheduling problem, since it can model the constraint that every job vertex is covered.) 
Babenko et. al. [5] showed that a maximum cardinality triangle-free 2-matching can be found in 
time 0(y / nm). This is done by reducing the problem to ordinary matching, with the help of the 
Edmonds-Gallai decomposition. 

3 This definition of a 2-matching scales the usual definition by a factor 1/2, i.e., the weights are usually 0,1 or 2. 
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Figure 3: Matching with four blossoms. 



Here we give two easy extensions of [5j: a simple application of the Mendelsohn-Dulmage 
Theorem [55] shows that a maximum cardinality triangle-free 2-matching can be found in the same 
asymptotic time as a maximum cardinality matching (e.g., the algebraic algorithm of Mucha and 
Sankowski [51] can be used) . This result extends to maximum cardinality triangle- free 2-matchings 
that are constrained to cover a given set of vertices (this is the variant needed for our scheduling 
problem). The development is based on standard data structures for blossoms. For instance, it 
is based on a simple definition of an augmenting cycle (analog of an augmenting path), leading 
to an augmenting blossom (which models the triangle cluster of Cornuejols and Pulleyblank) ; we 
show a maximum cardinality matching with the greatest number of augmenting blossoms gives a 
maximum cardinality cardinality triangle- free 2-matching (Lemma 4(i))o 

Our proof shows that the scheduling problem is solved correctly, independently of \16\ [T7\ [5]. 
We also extend the ideas to get similar results for arbitrary job lengths. 

7.3 Triangle-free 2-matchings 

This section gives an efficient algorithm to find a maximum cardinality triangle-free 2-matching. 

The approach to 2-matchings is through ordinary matchings. Fig. [3] illustrates the discussion. 
Wavy edges are matched. Square vertices are unmatched. 

All paths and cycles that we consider are simple. A path is alternating if its edges are alternately 
matched and unmatched; a cycle is alternating if one vertex is incident to two unmatched edges 
and otherwise the edges are alternately matched and unmatched. 

Consider a matching M on an arbitrary graph G. An augmenting cycle A is an alternating 
odd cycle of length > 3 that contains an unmatched vertex (i.e., a vertex ^ V(M)). Changing the 
weight of every edge of A to 1/2 gives a 2-matching of size \M\ + 1/2. We will show that a maximum 

4 Our algorithm was developed independently of the authoritative algorithm of [5]- 
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size triangle-free 2-matching, and a solution to our scheduling problem, can be constructed using 
augmenting cycles. 

We assume some familiarity with the blossom algorithm for maximum cardinality matching [13] 
but we review most notions. 

A blossom is a recursively defined subgraph B. The vertices of B are partitioned into an odd 
number of subgraphs Bi. Each Bi is either a single vertex or a blossom. If we contract each Bi to 
a vertex, the remaining edges of B form an alternating odd cycle spanning every Bi. Fig. [3] has 
four maximal blossoms B\-B±. 

Let Bq be the unique Bi that is incident to two unmatched edges of B. The base (vertex) b of 
B is Bq if Bq is a vertex, else it is the base of -Bo- An easy induction shows that every vertex of 
B — b is on a matched edge of B. If b is on a matched edge, that edge is incident to V(B). B is a 
matched (unmatched) blossom if b is on some (no) matched edge. 

Two well-known properties of blossoms can be shown by simple induction: 

PI: Let B be a blossom with base b. For any v G V(B), B contains an even length 
alternating path from v to b. 

P2: Let B be an unmatched blossom. The matching on E{B) can be modified to make 
any given vertex of B the base. 

A triangle cluster [T7] is is a connected subgraph whose biconnected components are all 
triangles. Note that a single vertex is a triangle cluster. Also, a subgraph is a triangle cluster 
exactly when it is connected, its edges can be partitioned into triangles, and any vertex shared by 
two or more triangles is a cutpoint. A blossom whose vertices induce a subgraph that is a triangle 
cluster is a t-blossom (B2-B4 in Fig. [3j); in the opposite case it is a non-t-blossom. A non-t-blossom 
is augmenting if it is unmatched (B\ in Figj3]). 

Lemma 2 The matching on an augmenting blossom B can be modified so V{B) is spanned by a 
set of matched edges and an augmenting cycle. 

Proof: Consider an arbitrary blossom A. Its recursive definition gives various subblossoms B, 
each composed of subgraphs Bi, i = 0, . . . , k, k even, along with edges UiVi + i, i = 0, . . . , k joining 
Bi to Bi+\. (Take k + 1 to be 0.) Consider one of these subblossoms B. 

Make uq the base of A (by P2). Let P be the even length alternating path in B from v\ to the 
base uq (by PI). Adding edge uo^i to P completes an alternating cycle C. A is spanned by C and 
the matched edges of A — C . In the first two cases below B can be chosen so |C| > 5, i.e., C is the 
desired augmenting cycle. 

Case 1: Some subblossom B has k > 2. Use B in the above procedure. Note that C goes through 
every Bi. (It starts in B±, so to reach uq E Bq it must go through B2, ■ ■ ■ ,Bk,BQ in that order.) 
Thus C goes through > 5 subblossoms and \C\ > 5. 

Case 2: Some subblossom B has some Ui 7^ Vi. C traverses a path in Bi from Vi to Ui. So it has 
> 2 vertices in Bi and > 1 vertex in every other subblossom, i.e., > 4 vertices. Since C has odd 
length, |C| > 5. 

Case 3: Every subblossom B has k = 2 as well as Ui = Vi for every i. Every subblossom consists 
of three edges forming a triangle. So the edges of A form a triangle cluster. 

Suppose A is an augmenting blossom. So A does not induce a triangle cluster, i.e., some edge 
uv E E(G) — E(A) joins two vertices of A. Make u the base of A. Let P be the even length 
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Figure 4: Hungarian subgraph for the graph in Fig. [3j 



alternating path in A from v to u (by PI). Adding edge uv to P completes an alternating cycle C . 
A is spanned by C and the matched edges of A — C . C is not a triangle since it contains edge uv 
not in the triangle cluster. Thus \C\ > 5. □ 

We use some more concepts from ordinary matching [IS] • An alternating tree has an unmatched 
root, every path from the root is alternating, and every leaf has even distance from the root. A 
vertex at even (odd) distance from the root is outer [inner). 

Fig. H] illustrates the following definition (for the matching of Fig. [3]): Let M be a maximum 
cardinality matching on G. A Hungarian subgraph H for M has nodes that are either vertices of 
G or contractions of blossoms in G. No vertex of G occurs more than once as a node of H or as 
a member of a blossom. Every unmatched vertex of G is either a node of H or the base of an 
unmatched blossom of H. H is spanned by a forest of alternating trees. Each blossom B of H is 
outer in this forest. Call a vertex of G outer if it is an outer node of H or in some blossom of H, 
and inner if it is an inner node. We sometimes write Vh to denote the set of all inner and outer 
vertices of G. The key property is: 

(*) Any edge vw of G with v an outer vertex has w inner or v and w in the same blossom. 

In Fig. H] (as well Fig. [3]) inner vertices are hollow, outer are filled. The three dashed edges are 
non-tree edges. 

We use a slight extension of this concept: Take G, M, H as before. Let B be the set of augmenting 
blossoms in H. (We shall see below that B does not depend on choice of Hungarian subgraph H.) 
Let T(M) be the triangle-free 2-matching of size \M\ + \B\/2 formed by using Lemma [2] on each 
blossom of B and enlarging M in the obvious way. A reduced Hungarian subgraph H for T{M) is a 
Hungarian subgraph on G — U{^(-^) ■ B £ B} ioi M restricted to this graph. We reiterate that a 
vertex of an augmenting blossom does not belong to Vh- The reduced Hungarian subgraph for the 
matching of Fig. [3] is Fig. |4]with B\ deleted, i.e., the leftmost dashed edge becomes a tree edge. 

Lemma 3 A reduced Hungarian subgraph for T(M) satisfies (*). 

Proof: For (*) to fail an edge must join an outer vertex v to an augmenting blossom B. This edge 
completes an augmenting path for M, from the unmatched vertex of B to the root of v's alternating 
tree. But this contradicts the maximum cardinality of M. □ 

This section uses the notion from matching theory of "set cover" (as in odd set covers, reviewed 
below). A set cover C for an arbitrary graph G is a subpartition of the vertices such that every edge 
of G either has both ends in the same set of C or at least one end in a singleton set of C. We use 
several types of these covers, each with its own definition of "capacity" of a set. The intent is that 
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the capacity C (i.e., the total capacity of all its sets) should upper bound the size of a matching of 
some type. 

In a triangulated set cover C, the capacity of a set S £ C is 1 for a singleton, L|5|/2j if E(S) 
is a triangle cluster and |S*|/2 otherwise. The total weight of any solution to LP (jSJ) is at most 
the capacity of any such cover C. In proof a singleton is incident to edges of total weight < 1 (its 
capacity). Let S C V be a non-singleton. The total weight of all edges with both ends in S is 
< |<S|/2. Finally suppose S induces a triangle cluster of t triangles. S has 2t + 1 vertices, and the 
edges with both ends in S weigh at most t = [\S\/2\ . 

The 2-matchings we wish to construct are characterized as follows: 

Lemma 4 (i) If M is a maximum cardinality matching with the greatest possible number of 
augmenting blossoms in its Hungarian subgraph, then T(M) is a maximum size triangle-free 2- 
matching. In fact it is a maximum size solution to |2|). 

(ii ) Let J be a set of vertices that can be covered by a triangle-free 2-matching. Let M be 
a maximum cardinality matching with the greatest possible number of augmenting blossoms in its 
Hungarian subgraph, subject to the constraint that every vertex of J is matched or in an augmenting 
blossom. If such an M exists, T(M) is a maximum size triangle-free 2-matching that covers J. In 
fact it is a maximum size solution to (E|) when every vertex of J is constrained to have equality in 

ma). 

Proof: (i) Let H be a reduced Hungarian subgraph for T(M). Let B be any blossom in H. A 
maximum cardinality matching with B unmatched can be formed (by interchanging the matched 
and unmatched edges in the path from B to the root of its alternating tree). This makes B an 
augmenting blossom if it is a non-t-blossom. So the choice of M makes B a t-blossom. 
Consider the family of sets 

c = [v-v H , W, V(B) : v an inner vertex, B a blossom of Hj. 

C is a set cover (property (*) shows edges with an outer end are handled correctly). The size of 
T(M) equals the capacity of C considered a triangulated set cover (since no edge of positive weight 
leaves V — Vh, any inner vertex is on a weight 1 edge leading to an outer vertex, and the number 
of weight 1 edges with both ends in a blossom B of H is |_| V(.B) |/2j , which equals the capacity of 
V(B) as a triangle cluster). Part (i) follows. 

(ii) T(M) covers J since it covers every vertex in an augmenting blossom of M. The rest of the 
argument is identical to part (i). □ 

We turn to the algorithm to find a maximum cardinality triangle-free 2-matching. First recall 
two more ideas from matching theory. An odd set cover C for an arbitrary graph G is set cover (as 
defined above) where the capacity of a set S G C is 1 for a singleton, else U51/2J [H2 09]. 

It is easy to see that the cardinality of any matching is at most the capacity of any odd set cover. 
Furthermore equality can always be achieved: Take any Hungarian subgraph H for any maximum 
cardinality matching M. An argument similar to the previous one shows that \M\ equals the 
capacity of the odd set cover 

(v — Vh, {v}, V(B) : v an inner vertex, B a blossom |. 

Next we recall the Edmonds-Gallai decomposition, which gives the structure of any maximum 
cardinality matching. (Parenthetic remarks will sketch proofs that this decomposition is correct. 
Let C be an odd set cover whose capacity is the size of a maximum cardinality matching.) Call 
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a matching perfect on S C V if every vertex of S is matched with another vertex of S, and near- 
perfect if this holds for all but one vertex of S. (So IS"! is even in the first case and odd in the 
second.) 

Start with a Hungarian subgraph H for some maximum cardinality matching. (For our proofs, 
let C be the odd set cover constructed from H as described above.) We will describe the structure 
of an arbitrary maximum cardinality matching M. 

(a) M contains a perfect matching of the vertices not in Vh- (This follows since V — Vr 6 C.) 

(b) M contains a near-perfect matching of any blossom B of H. (This follows since B £ C.) 

(c) M contains a maximum cardinality matching of the nodes of H. (This follows from the 
maximum cardinality of M and (a)-(b).) In fact this matching is a maximum cardinality 
matching of the bipartite graph BG whose edge sets are the inner nodes, and the outer 
nodes of H and whose edges are the edges of H that join inner node to outer. (This follows 
since C shows M does not contain any edge joining two inner vertices.) In FigU] BG does 
not contain the horizontal dashed edge. 

We remark that the matching of (c) covers every inner vertex (again by C). Note that each 
edge of BG can be identified with one or more edges of G. Once this edge is chosen the matching 
of (c) determines the unmatched vertices in the near-perfect matchings of (b). So starting from 
any maximum cardinality matching BM of BG, we can construct a corresponding matching of G 
satisfying (a)-(c), i.e., a maximum cardinality matching of G. 

The last part of Edmonds-Gallai states that the sets V(B) for blossoms B of H are an invariant 
of G. (In proof, property (*) implies the outer vertices of H are precisely those vertices that can 
be reached from an unmatched vertex (of M) by an even length alternating path. Thus the outer 
vertices constitute the set U of vertices that are unmatched in some maximum cardinality matching. 
The sets V(B) are the non-singleton connected components of U.) 

In the rest of this section, O denotes the set of outer nodes in the above bipartite graph BG, 
and N C O denotes its set of non-t-blossoms. 

Lemma 5 (i) Let BM be a maximum cardinality matching of BG that matches the greatest possible 
number of nodes of O — N . Then T(M) is a maximum size triangle-free 2-matching if M is a 
maximum cardinality matching of G corresponding to BM (as described above). 

(ii) Let J be a set of vertices that can be covered by a triangle-free 2-matching. Let BM be a 
maximum cardinality matching of BG that matches every node of O — N that is a subset of J, and 
as many other nodes of O — N as possible. Then T{M) is a maximum size triangle-free 2-matching 
that covers J if M is a maximum cardinality matching of G corresponding to BM such that no 
base vertex of an unmatched blossom of O — N belongs to J. 

Proof: (i) Let BM* be a matching on BG that corresponds to a maximum cardinality matching 
of G having the greatest possible number of augmenting blossoms. The invariance of blossoms 
implies that the augmenting blossoms of BM* are precisely the blossoms of N that are unmatched 
in BM*. By Lemma H] we need only show that BM has as many augmenting blossoms as BM*. 

The size of a matching on BG is the number of matched outer nodes, so \0 n V(BM)\ = 
\0 n V(BM*)\. By definition \(0 - N) D V(BM)\ > \(0 - N) D V{BM*)\. So \N n V(BM)\ < 
\N n V(BM*)\. This implies the desired inequality \N - V(BM)\ > \N - V{BM*)\. 

(ii) Suppose 

(a) the matching BM of part (ii) exists, and 

(b) matching M of part (ii) exists and covers every vertex of J. 
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Then part (ii) follows using the argument for part (i). 

To establish (a)-(b), let Jo denote the set of nodes of O — N that are subsets of J. 

Claim BG has a matching BMq that covers every node of Jo. 

The claim implies (a). To show it implies (b), Edmonds-Gallai shows any unmatched node of 
BM is outer. The claim shows any blossom of O — N that is unmatched in BM contains a vertex 
v ^ J. v can be made the unique unmatched vertex of B (P2). So M covers J — N, as does T(M). 
T(M) covers every vertex in a blossom of N. (b) follows. 

To prove the claim let M 2 be a triangle-free 2-matching that covers J, viewed as a collection 
of matched edges and odd cycles. We will construct a set of edges S C M 2 such that each outer 
node covered by M 2 has degree > 1 in S, and each inner node has degree < 1. Clearly S contains 
the desired matching BMq (BMq is a subset of BG by property (*)). 

We construct S by traversing the connected components of M 2 , maintaining the invariant that 
any outer node covered by a traversed edge of M 2 has degree > 1 in S and any inner node has 
degree < 1 in S. We traverse a component C of M 2 as follows. 

Suppose C is a matched edge. Add it to S. Clearly this preserves the invariant. 

Suppose C is an odd cycle. The edges of C n BG form a number of connected components. 
Traverse each such component, adding alternate edges to S. An outer node x in such a component 
is on > 1 edge added to S, since C leaves x each time it enters it, and property (*). An inner node 
x maintains degree < 1 in S, since x is on just 2 edges of C (each of which may or may not be in 
BG). The claim follows. □ 

In summary the algorithm to find a maximum cardinality triangle-free 2-matching works as 
follows. Find a maximum cardinality matching Mq of G and its Hungarian subgraph H. Use 
H to construct BG. Find a maximum cardinality matching M\ of BG — N . Find a maximum 
cardinality matching Mi of BG that covers all nodes covered by M\. Extend M2 to a maximum 
cardinality matching M3 of G, using (a)-(c). Convert M3 to the 2-matching T(M^) by reweighting 
the augmenting blossoms. Return T(M^). 

If the 2-matching is required to cover a set of vertices J two simple changes are needed: Take 
Mi as a maximum cardinality matching of BG — N subject to the constraint that it matches every 
node of O — N that is a subset of J. Take M3 so it leaves a vertex not in J unmatched in each 
unmatched blossom of O — N. 

Theorem 10 A maximum cardinality triangle-free 2-matching can be found in time 0(y/nm). The 
same holds if the 2-matching is constrained to cover a set of vertices J, assuming J can be covered 
by some triangle-free 2-matching. 

Proof: The maximum cardinality matchings are found in time 0(y/nm) \50\ 133] . A Hungarian 
subgraph for a given maximum cardinality matching can be found in time 0(m) \27\ I33|. Hence 
the total time for both of our algorithms is 0(\fnm). □ 

Corollary 2 A maximum cardinality triangle-free 2-matching can be found in the same asymptotic 
time as a maximum cardinality matching. The same holds if the 2-matching is constrained to cover 
a set of vertices J. 

Proof: The argument for a 2-matching constrained to cover J is essentially the same as the 
unconstrained case, so we discuss only the latter. We show that excluding the time for maximum 
cardinality matching, our algorithm uses time 0(m). 
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As just noted H is constructed in 0(m) time in \27\ 133] . This construction also supplies the 
recursive decomposition of each blossom B into subblossoms B{, so it is easy to find the near-perfect 
matchings of (b) in Edmonds-Gallai (for M3). It is also easy to classify each blossom of H as a 
triangle cluster or a non-t-blossom. 

It only remains to describe how to find the maximum cardinality matching M2 of BG that covers 
all nodes covered by M\. The following well-known procedure works for an arbitrary matching M\ 
on an arbitrary graph G. 

Start by finding any maximum cardinality matching M2 of G. The edges of M\ U M2 form a 
number of connected components, each of which is a path or cycle with edges alternating between 
the two matchings. A node x G V{M\) — V{M%) is the end of a component that is an alternating 
path P. P has even length and its other end y is in V{M%) — V{M\) (since M2 has maximum 
cardinality) . So replacing the edges of P D M2 by P n M\ makes x covered and keeps M2 maximum 
cardinality, {y becomes uncovered but this is not a problem.) Doing this for all such x makes M2 
the desired maximum cardinality matching. Clearly the entire construction uses time 0(n). □ 

It is well-known that the problems of maximum size 2-matching and maximum cardinality 
bipartite matching have the same asymptotic time bound. It is also clear that finding a maximum 
size triangle-free 2-matching is at least as hard as maximum cardinality bipartite matching. 

7.4 Applications to preemptive scheduling for B = 2 

The algorithm of Theorem [TUl along with the LP ([7|) , solves the preemptive scheduling problem for 
unit jobs. We begin by extending the solution to arbitrary job lengths £j. 

We reduce the general problem to the unit length case, using a graph U G with unit jobs defined 
as follows. As before each time slot s is represented by vertices s\,S2 and edge siS2- A job j of 
length £j, which may be scheduled in a set of slots Sj, is represented by vertices Uj s , s G Sj, each 
with two edges Uj s Si, i = 1,2. In addition job j has vertices Uji, i = 1, . . . , \Sj\ — lj. with a complete 
bipartite graph joining its two types of vertices Uj s and Uji. 

The triangles of UG have the form Uj s , s±, S2 and are similar to those of MG. Define the sets 
of vertices Jjj = {uj s : j £ J, s G Sj}, Jjj = {uji : j G J, i < \Sj\ — £j}. It is easy to see that a 
solution to LP (|8j) on graph U G that covers every vertex of Jjj U Jjj gives a feasible solution to LP 
([7]) and vice versa. So as before [17] (or Lemma HJii)) shows the solution to our problem is given 
by a triangle-free 2-matching on U G that covers Jjj U Jjj and has the greatest cardinality possible. 

Theorem 11 Let J be a set of unit jobs that can be scheduled on B = 2 processors. A preemptive 
schedule for J minimizing the active time can be found in 0{^Jnm) time. The result extends to 
jobs of arbitrary integral lengths £j, where the time is OiyLrn) for L the sum of all job lengths. 

Proof: The algorithm for triangle-free 2-matching (Theorem flOl) can be implemented in time 
0(y/~Lm) on graph UG. To do this we use the vertex substitution technique of [26] which works 
on graphs of 0(m) edges rather than UG itself. □ 

A standard construction from network flow shows that regarding feasibility, allowing arbitrary 
preemption does not help in this sense: For any number of processors, a set of jobs that can be 
feasibly scheduled with arbitrary preemption can also be scheduled limiting preemption to integral 
time points, assuming the jobs are either all unit length, or they have arbitrary integral lengths lj 
and time is slotted. (In detail use a flow graph where job j is a source of capacity £j, slot t is a 
sink of capacity p, and edge jt exists when j can be scheduled in slot t. The Integrality Theorem 
shows we can assume each j gets scheduled in exactly lj time slots.) 
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However, even in those situations, arbitrary preemption can reduce active time. For example 
when B = 2, three unit jobs that may be scheduled in slots 1 or 2 require 2 units of active time 
using preemption at integer times and only 3/2 units with arbitrary preemption. We now show 
this example gives the greatest disparity between the two preemption models for B = 2. 

Recall that we construct an optimal preemptive schedule V by finding a a maximum size triangle- 
free 2-matching as specified in Lemma H^ii) (on MG for unit jobs, and UG for arbitrary length 
jobs) and converting it to V in the obvious way via LP ([7]). Part (Hi) of the lemma below gives 
the main property for establishing the desired bound. The following lemma is proved by examining 
the structure of blossoms in our special graphs, e.g., the triangles all have the form js\S2 for j a 
job vertex and s\S2 the edge representing a time slot; also vertices s± and S2 are isomorphic. 

Lemma 6 (i) A blossom in MG (UG) is a triangle cluster iff it contains exactly one vertex of J 
(Jij), respectively. 

(ii ) Let H be a Hungarian subgraph in MG or UG. Any slot s either has both its vertices inner, 
or both its vertices outer (and in the same blossom) or neither vertex in Vh- 

(Hi) Let the optimal preemptive schedule V be constructed from T(M), and let B be an aug- 
menting blossom of M . The time slots with both vertices in B have > 3/2 units of active time in 
V. 

Proof: (%) A triangle T in MG or UG must contain a slot edge s±S2- So the other two edges must 
be jsi,js2 for vertex j £ J U Jjj. A triangle T' that shares a vertex but no edge with T must 
contain vertex j. So the triangle clusters are sets of edges js\,js2, S1S2 where j is a fixed vertex of 
JU Jjj. (In UG a cluster has only one triangle.) 

Clearly this implies any t-blossom has exactly one vertex of JU Jjj. 

Conversely let B be a non-t-blossom. Make B an augmenting blossom by unmatching any edge 
incident to its base. Then Lemma [2] shows B has an alternating cycle C of > 5 edges. C must 
contain an edge of the form s\S2- The two other edges of C incident to s\ and S2 must go to vertices 
of J U Jjj that are distinct. 

(ii) First observe that s\ £ Vh implies S2 £ Vh- If si is outer this follows from edge s±S2- If si 
is inner then its parent contains an outer vertex j £ JU Jjj adjacent to s±. j is also adjacent to S2 
so s 2 £ Vh- 

We complete the proof by showing that if s± is outer then so is S2- If, on the contrary, S2 is 
inner then it is adjacent to an outer vertex in its parent and an outer vertex in its child. These 
vertices are obviously in different blossoms. But s± is adjacent to both of them, so both are in the 
blossom containing s±, contradiction. 

(Hi) First consider the case of unit jobs and graph U G. Any blossom B in a Hungarian subgraph 
contains an even number of slot vertices s\,S2 (part (H)) and an odd number of vertices in total. 
So B contains an odd number of J-nodes. Suppose B is augmenting. Part (i) now implies B has 
> 3 J-nodes. V schedules the jobs of B in the time slots of B. Clearly they use > 3/2 units of 
active time. 

Next consider arbitrary length jobs and graph MG. Let B be an augmenting blossom in U G. 
Lemma [2] shows B has an alternating cycle C of > 5 edges. 

Since \C\ is odd, C contains an odd number of edges of the form siS2- Each such Sj is adjacent 
in C to a J(/-vertex. The corresponding edge has weight 1/2 in T(M), so slot s has (exactly) 1/2 
unit of active time in V . If C has > 3 such edges s\S2 then the slots of B have > 3/2 units of active 
time in V . So suppose C contains exactly one s\S2 edge. 

The path in C that avoids s\S2 must contain some slot vertex t±, t € S (recall the definition of 
MG). Since t\t2 ^ C, t\ is adjacent to two ./^-vertices in C. The corresponding edges have weight 
1/2 in T(M). So slot t has 1 unit of active time and s has 1/2, giving the desired 3/2 units. □ 
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Theorem 12 For B = 2 and a set of jobs J, the minimum active time in a schedule permitting 
preemption only at integer times is < 4/3 times that of one allowing arbitrary preemptions. 

Remark: The theorem holds for unit jobs, and for jobs of arbitrary length £j when time is slotted, 
i.e., a schedule with preemptions only at integer times is allowed to execute a length £j job in lj 
distinct time slots, but no more. If time is continuous the ratio of the theorem can approach the 
trivial bound of B = 2 when job lengths are arbitrary. For example consider a length t job with 
release time and deadline I 2 , and unit jobs j = 0, ...,£ — 1 with release time £j and deadline 
£j + 1. The ratio (2£ — l)/£ approaches 2. 

Proof: An augmenting blossom increases the size of the 2-matching by 1/2. Recalling the objective 
function of LP ([7]) we see this increases the number of inactive time units by 1/2, i.e., it decreases 
the number of active time units by 1/2. 

Our optimal preemptive schedule V is constructed from T(M), where the matching M corre- 
sponds to an optimal schedule M with preemptions limited to integer times (recall Lemma [1]) . Let 
M have it augmenting blossoms and let V have a active time units. So Af has a + ir/2 active time 
units. 

The preceding lemma (part (Hi)) shows a > (3/2)-7r. Thus the number of active time units in 
M exceeds the number in V by a factor 

(a + vr/2)/a = 1 + (vr/2)/a < 4/3. 

□ 

8 Conclusion 

In this paper, we defined a new problem which involves scheduling jobs in batches of size at most 
B. Each job has periods of time within which it must be scheduled and the goal is to minimize 
the number of active time slots in the schedule. No cost is incurred for slots in which no jobs are 
scheduled. There is a strong connection between this problem and other classic covering problems 
such as vertex cover with hard capacities and the if-center problem. Another general model of 
energy consumption might allow for the energy consumption of each active slot to depend on the 
number of jobs actually assigned to that time slot. At least for the B = 2 case this can be handled 
easily, by adapting the matching based solution described in Section [5j 

One could generalize this model further to include other objective functions that measure com- 
pletion times, tardyness, etc. Furthermore, it would be interesting to consider the online setting in 
which the entire set of jobs is not known in advance, but jobs arrive over time and are known only 
when they are released, or perhaps shortly before they are released. 
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